Distribution-agnostic Linear Unbiased Estimation with Saturated Weights for Heterogeneous Data

نویسندگان

چکیده

The challenging problem of distribution-agnostic linear (weighted) unbiased estimation a global parameter from heterogeneous and unbalanced data is addressed. This setup may originate in different signal processing contexts involving the joint non-homogeneous groups whose statistical distribution unknown, with (possibly highly) diverse sample sizes. Since estimators local variances are inaccurate low-sample regime, suitable weighting schemes required. For this problem, we study family based on idea trimmed weights, i.e., proportional to size but proper saturation. Such an approach theoretically analyzed, showing that it can be linked Maximum Entropy principle under uncertainty generative model (as well as broader class cost functions). Different criteria for setting “cut-off” threshold between saturated regions also obtaining reduced-complexity approximation optimal minimum-variance estimator generalized mixed-effect model. To aim, further contribution several hyperparameter derived analyzed. proposed analyzed its performance assessed against state-of-the-art estimators. An illustrative application real-world COVID-19 finally developed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Best Linear Unbiased Estimation in Linear Models

where X is a known n × p model matrix, the vector y is an observable ndimensional random vector, β is a p × 1 vector of unknown parameters, and ε is an unobservable vector of random errors with expectation E(ε) = 0, and covariance matrix cov(ε) = σV, where σ > 0 is an unknown constant. The nonnegative definite (possibly singular) matrix V is known. In our considerations σ has no role and hence ...

متن کامل

Unbiased bootstrap error estimation for linear discriminant analysis

Convex bootstrap error estimation is a popular tool for classifier error estimation in gene expression studies. A basic question is how to determine the weight for the convex combination between the basic bootstrap estimator and the resubstitution estimator such that the resulting estimator is unbiased at finite sample sizes. The well-known 0.632 bootstrap error estimator uses asymptotic argume...

متن کامل

Distribution-Specific Agnostic Boosting

We consider the problem of boosting the accuracy of weak learning algorithms in the agnostic learning framework of Haussler (1992) and Kearns et al. (1992). Known algorithms for this problem (BenDavid et al., 2001; Gavinsky, 2002; Kalai et al. , 2008) follow the same strategy as boosting algorithms in the PAC model: the weak learner is executed on the same target function but over different dis...

متن کامل

Inverse DEA Model with Fuzzy Data for Output Estimation

In this paper, we show that inverse Data Envelopment Analysis (DEA) models can be used to estimate output with fuzzy data for a Decision Making Unit (DMU) when some or all inputs are increased and deficiency level of the unit remains unchanged.

متن کامل

Learning classifiers with ternary weights Learning linear classifiers with ternary weights from Metagenomic Data

Motivated by recent researches in metagenomic classification tasks, this paper investigates the problem of finding interpretable concepts from training data in which the number of features is larger than the number of samples. In this setting, the classification problem is modeled as a combinatorial optimization problem, in which the aim of the learner is to find a {−1, 0, +1}-weighted linear t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Signal Processing

سال: 2023

ISSN: ['1053-587X', '1941-0476']

DOI: https://doi.org/10.1109/tsp.2023.3293908